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Optimizing Data Aggregation for Uplink 
Machine-to-Machine Communication Networks 

Derya Malak, Harpreet S. Dhillon, and Jeffrey G. Andrews 


Abstract 

Machine-to-machine (M2M) communication’s severe power limitations challenge the interconnec¬ 
tivity, access management, and reliable communication of data. In densely deployed M2M networks, 
controlling and aggregating the generated data is critical. We propose an energy efficient data aggre¬ 
gation scheme for a hierarchical M2M network. We develop a coverage probability-based optimal data 
aggregation scheme for M2M devices to minimize the average total energy expenditure per unit area per 
unit time or simply the energy density of an M2M communication network. Our analysis exposes the key 
tradeoffs between the energy density of the M2M network and the coverage characteristics for successive 
and parallel transmission schemes that can be either half-duplex or full-duplex. Comparing the rate and 
energy performances of the transmission models, we observe that successive mode and half-duplex 
parallel mode have better coverage characteristics compared to full-duplex parallel scheme. Simulation 
results show that the uplink coverage characteristics dominate the trend of the energy consumption for 
both successive and parallel schemes. 


I. Introduction 

Machine-to-machine (M2M) applications are rapidly growing, and will become an increasingly 
important source of traffic and revenue on 4G and 5G cellular networks. Unlike video applica¬ 
tions, which are expected to consume around 70% of all wireless data by the end of the decade 
HD, M2M devices will use a comparatively small fraction. However, M2M communication has 
its own challenges. The air interface design for high-data-rate applications may not effectively 
support M2M’s vast number of devices, each usually having only a small amount of data to 
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transmit. Thus, M2M will require sophisticated access management and resource allocation with 
QoS constraints to prevent debilitating random access channel (RACH) congestion |2), and 
to enable link adaptation with low overhead, reduced energy consumption, and efficient control 
channel design 0. Bluetooth (IEEE 802.15.1), Zigbee (IEEE 802.15.4) and WiFi (IEEE 802.11b) 
are a few examples of current technologies that have been used for M2M communication 0 . 
Meanwhile, 3 GPP has been studying M2M communication in its standardization process for 
Long Term Evolution-Advanced (LTE-A). In Release 13, a low-complexity user equipment 
(UE) category for M2M devices is specified, in which a UE has reduced bandwidth and lower 
maximum transmission power, and can operate in all the duplex modes 0|. However, there is 
still no consensus on the general network architecture for large scale M2M communication. 

A. Motivation and Related Work 

Unlike most human generated or consumed traffic, M2M as defined in this paper is charac¬ 
terized by a very large number of small transactions, often from battery powered devices. The 
power and energy optimal uplink design for various access strategies is studied in 0, while an 
optimal uncoordinated strategy to maximize the average throughput for a time-slotted RACH is 
developed in [j6]|. For the small payload sizes relevant for M2M, a strategy that transmits both 
identity and data over the RACH is shown to support significantly more devices compared to the 
conventional approach, where transmissions are scheduled after an initial random-access stage. 

Clustering is a key technique to reduce energy consumption and it increases the scalability and 
lifetime of the network 0, 0. A Low-Energy Adaptive Clustering Hierarchy protocol (LEACH) 
to evenly distribute the energy load among the sensors and enable scalability for dynamic net¬ 
works by incorporating localized coordination is proposed 0, lITOll . A low overhead protocol that 
periodically selects cluster-heads according to the node residual energy to support scalable data 
aggregation is established in lITTll . A Distributed Energy-Efficient Clustering (DEEC) scheme, 
where the cluster-heads are elected by a probability based on the ratio between residual energy 
of each node and the average energy of the network is proposed in lH2l . A data gathering 
mechanism, where clusters closer to the BS have smaller sizes than those farther away from the 
BS to balance the energy consumption is analyzed in [fl3ll . Although these techniques are energy 
efficient, they do not incorporate the coverage characteristics of wireless transmissions. 

A stochastic analysis to determine how many data collectors per area are required to meet 
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the outage probability for a given sensor node density is implemented in Ifl4l . Modulation 
strategies to minimize the total energy to send a given number of bits are analyzed in lTT5l . The 
energy efficiency of a large scale interference limited ad hoc network at physical, medium access 
control (MAC) and routing layers is quantified Ifl6l . Optimal power allocation to minimize the 
total energy consumption in a clustered WSN where sensors cooperatively relay data to nearby 
clusters is studied in ifTTI . The energy-optimal relay distance and the optimal energy to transmit 
a bit successfully over a particular distance are analyzed in lH8l . 

Different control mechanisms to avoid congestion caused by random channel access of M2M 
devices are reviewed m. An adaptive slotted ALOHA scheme for random access control of 
M2M systems with bursty traffic that achieves near-optimal access delay performance is proposed 
ll20l . A comprehensive performance evaluation of the energy efficiency of the random access 
mechanism of LTE and its role for M2M is provided ll2T1 . An energy-efficient uplink design 
for LTE networks in M2M and human-to-human coexistence scenarios that satisfies quality-of- 
service (QoS) requirements is developed |[22l . Similar to 11211 . If22l . we study an energy-efficient 
design for M2M uplink where devices perform multi-hop transmissions. We also incorporate the 
coverage characteristics for different transmission modes using stochastic geometry. 

Because low-power M2M devices may not be able to communicate with the BS directly, 
hierarchical architectures may be necessary. Hence, critical design issues also include optimizing 
hierarchical organization of the devices and energy efficient data aggregation. Although these 
issues have not been studied in the context of M2M, there is prior work on distributed networks 
in the context of wired communications. In ll23l . energy consumption is optimized by studying a 
distributed protocol for stationary ad hoc networks. In ll24ll . a distribution problem which consists 
of subscribers, distribution and concentration points for a wired network model is studied to 
minimize the cost by optimizing the density of distribution points. In lf25l . a hierarchical network 
including sinks, aggregators and sensors is proposed, which yields significant energy savings. 

Hierarchical networks can provide efficient data aggregation in M2M or other power-limited 
systems to enable successful end-to-end transmission. Despite previous research efforts, e.g., Ifl4ll . 
|[8l and lfl8l . to the best of our knowledge, there has been no study focusing on data aggregation 
schemes for M2M networks together with the rate coverage characteristics, especially from an 
energy optimal design perspective. Providing such a study is the main contribution of this paper. 
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B. Contributions and Organization 

A large-scale hierarchical wireless network model for M2M. We propose a new communi¬ 
cation model for the M2M uplink. In Sect. HU we consider a hierarchical scenario to model the 
wireless transmissions where the hierarchical levels describe the multi-hop stages of the model. 
Each level is composed of the transmitter and aggregator processes. Once the transmissions 
of the current hierarchical level are completed, the transmitters are either turned off or kept 
on depending on the transmission scheme used, and in the subsequent hierarchical level, the 
aggregator process forms the new transmitter process. Since each device can act as a transmitter 
or an aggregator in a particular time slot, we model the devices as wireless transceivers. In Sect. 
m the aggregation process is repeated over multiple levels to generate a hierarchical model. We 
show there is an optimal fraction of aggregators that minimizes the overall energy consumption. 

SIR and rate coverage probability. In Sect. [V] we analyze the SIR coverage and rate coverage 
characteristics of the multi-stage transmission process to determine the optimal number of stages. 
We consider two possible transmission techniques: i) successive scheme, where the hierarchical 
levels are not active simultaneously, and ii) parallel scheme, where either all the levels are 
active simultaneously as in full-duplex transmission or where the active levels are interleaved as 
in half-duplex mode. We describe the proposed transmission models in detail in Sect. [VU 

Optimizing the number of multi-hop stages. We propose a general aggregator model for 
power limited M2M devices. Since the M2M devices are power limited and thus range limited, 
multi-hop routing is a feasible strategy rather than direct transmissions. However, in designing 
multi-hop protocols, the number of hops cannot be increased arbitrarily due to the additional 
energy consumption incurred by relays; long-hop routing is a competitive strategy for many 
networks lf26ll . We find an upper and lower bound on the optimal number of hops in Sect. [Vj 

Optimizing the network energy density. We use a generic transceiver scheme to model the 
energy consumption of the devices. For multi-hop transmission models, we find the optimal 
fraction of aggregators independently chosen from the set of devices that minimizes the average 
total energy consumption per unit area, i.e., the mean total energy density, for a fixed payload 
per device of an M2M network. We incorporate the energy consumption of aggregators and 
transceiver circuit components into the energy model. In Sects. [HU and [TV] we optimize the total 
energy density of the transmissions based on a given SIR coverage requirement for the devices. 
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We evaluate the performance of the proposed techniques in Sect. IVII1 and provide a comparison 
in terms of their energy densities and communication rates through numerical investigations. 


II. System Model 

We consider a cellular-based uplink model for M2M communication where the BS and device 
locations are distributed as independent Poisson Point Processes (PPPsJ*] with respective densities 
of Abs and A with A Abs- Each BS has an average coverage area of Agg, and each device 
has a fixed payload of M bits to be transmitted to the BS. We also assume open loop power 
control with maximum transmit power constraint under which the transmit power of a device 
located at distance d from the BS i|] Pr(d) = min{P Tmax , Prd 01 }, where a is the path loss 
exponent, -Pr max is the maximum transmit power constraint and P-p is the received power when 
d < (7 r mf]X /EV) l /M • With this assumption, the average received power at the BS from a device 
in its coverage area and located at distance d from the BS is constant and equal to P R (d) = 
min{P Tmax d -a , Pt}- Assuming N a device^ are scheduled based on TDMA, the uplink SINR 
for any device is SINR = P R (d)(I ic + / oc + NqW)~ 1 , where I 1C and I oc are the intra cell and 
out of cell interference Q, N 0 is the power spectral density and W is the bandwidth. 


A. Data Aggregation and Transmission Model 

Devices transmit data to the BS by aggregating data. The initial device process T is indepen¬ 
dently thinned by probability 7 < 0.5 to generated the aggregator process 'Iy, with density A a 
and the device (transmitter) process ^/ u with density A„, where A = X u + A a . Each transmitter is 
associated to the closest receiving device (aggregator), i.e., the transmitting devices within the 
Voronoi cell of the typical aggregator device will transmit their payloads to that aggregator. 


*PPP is not just plausible, basically a tractable model where the points are randomly and independently scattered in the space, 
and analysis for a typical node is permissible in a homogeneous PPP by Slivnyak’s theorem ( 2 ). 

2 Later in Sect. m in evaluating the SIR-based coverage probability, we also incorporate the small-scale fading into the analysis 
that is assumed to be independent and identically distributed (iid) with unit mean. Therefore, incorporating fading yields the 
same average energy analysis. To keep the notation simple, we do not incorporate fading in Sects. MM and m 

3 N a (random variable) denotes the number of devices in the Voronoi cell of a typical aggregator, and is detailed in Sect. IIIII 

4 The interference is due to simultaneously active aggregator cells. Users within each Voronoi cell are assumed to use TDMA 
for access, and at a particular time slot, there is only one active transmitting device in each cell. For the sequential mode, the 
interference is due to the active transmitters outside the typical cell. On the other hand, for the parallel transmission mode, since 
all the stages are simultaneously active, there is both intra cell and out of cell interferences., which is detailed in Sect. m 

Tn Sect. M we will motivate the choice of 7 < 0.5 in our setup. 
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The aggregation process can be extended to multiple stages. Each hierarchical level is com¬ 
posed of the transmitter and the aggregator processes, where the aggregator processes of all stages 
are initially determined such that they are disjoint from each other. At each stage, after the set of 
transmitters transmit their payloads to their nearest aggregators, and once the transmissions of a 
hierarchical level are completed, the transmitters are turned off and excluded from the process. In 
the subsequent hierarchical level, the aggregators of the previous stage become the transmitters, 
and they transmit their data to the aggregator process of the new stage. The aggregation process 
is repeated over multiple stages to generate the hierarchical transmission model. The process 
ends when all the payload is transmitted to the BS in the last stage of this multi-hop process, 
which we call a transmission cycle. The aggregation model will be detailed in Sects. HTT1 and HVl 

B. Wireless Transceiver Model 

We model the devices as wireless transceivers, since each device can act as a transmitter or 
an aggregator in a particular time slot. In ll28l . a generic architecture for energy-limited wireless 
transceivers is provided. The transceiver has four major building blocks. The transmit block (TX) 
is responsible for modulation and up-conversion, the receive block (RX) for down-conversion and 
demodulation, the local oscillator (LO) block for the generation of the required carrier frequency, 
and the power amplifier (PA) block for amplification of the signal to produce the required RF 
transmit power P T , where we assume the maximum transmit power is bounded and given by 
/V riiax • The power consumption in the receive and transmit paths are 

-Prxr = -Flo + -Frx + Poi Ptxr — Plo + Ptx + Ppa, (1) 

where the power consumption of LO, RX and TX blocks are denoted by Plo, -Frx and P T x, 
respectively, which are non negative constants. P Q is the receiver overhead power that is assumed 
to be constant. The PA power consumption is given by Ppa = ^ _ 1 Pr, where 77 is the PA 
efficiency, which is constant in the linear regime. The average energy cost of the transceiver is 
given by j3 R • P rxr + f3 T • P txr where f3 T = t tx /(t tx + F rx ) and /3 R = f rx /(f t x + F rx ), and t rx and F tx 
stand for the average receive and transmit times. The symbol definitions are given in Table [0 

In the proposed model, multiple devices (transmitters) send data to an aggregator (receiver), 
where each device is allocated a different time slot on the same frequency, i.e., TDMA. Therefore, 
(3t = 1 and (3r = 0 for transmitting devices, and their energy cost is proportional to P txr . For the 
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Parameter 

Symbol 

Parameter 

Symbol 

Power consumption in the receive (transmit) path 

-frxr (-^txr) 

RF transmit power; maximum transmit power 

Pr m „ 

Power consumption of the receive (transmit) block 

Prx (Ptx) 

RF receive power; mean total received power 

Pr; Pr 

Power consumption of the local oscillator (LO) 

PlO 

Path loss exponent 

a 

Receiver overhead power 

Po 

Average receive (transmit) time in one cycle 

^rx (ftx) 

Power consumption of the power amplifier PA 

PpA 

Transmission/reception time slot duration 

At 

PA efficiency 

V 

Receiver (transmitter) activity factors 

Pr (Pt) 


TABLE I: Notation. 

aggregator, (3 T = 0 and fj R — 1, and its energy cost is proportional to P rxr . Let E R denote the 
average energy required by an aggregator. Ey denotes the average energy required for all trans¬ 
missions within the coverage of an aggregator node, i.e., if N a devices transmit to the aggregator, 
E t is the sum of their average transmission energies. Hence, for a total transmission/reception 
time slot of duration At, aggregator and total transmitter energy consumptions are 

Er = E[A£(Plo + -Prx + Po)]j Pt = E[Af (N a pLo + Ptx + Ppa)], (2 ) 

where we assume N a devices sequentially transmit to the aggregator, the time slot satisfies 
At = E[N a ]i tx = f rx > f tx , and we assume that the communication delay due to processing of 
the data is negligible. Hence, transmitted data can be received within the time slot allocated and 
the aggregator can decode the received data. We also assume that the transmissions and receptions 
are synchronized. E R is mainly determined by the reception duration, while E r depends on the 
energy consumption of the PA, t tK and N a . Furthermore, whenever a device is in sleep mode, its 
transmitter and receiver modules are not active but other components, i.e., LO and RX blocks, 
are still consuming energy that justifies the scaled term N a P LO of E t in ([2]). 

We will use this transceiver model and the data aggregation strategy described to formulate 
our energy optimization problems in Sects. fTTHiTVl and analyze the SIR coverage in Sect. [V] 

III. Single-Stage Energy Density Optimization 

The main focus of this section is to model the average total energy density for a single-stage 
data aggregation scheme, which is a single hop energy model incorporating the data aggregation 
and transceiver model described in Sect. UH This model paves the way for understanding the 
multi-stage energy model to be discussed in Sect. HV] 

For the single-stage model, recall that initially a density A a < A/2 of the devices will be 
independently selected as aggregators, and a density of X u = A — A a will be the transmitters. 
Then, the set of aggregators, 'T a , collects the data from the remaining devices, T u , based on 
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nearest aggregator association, and each aggregator might have multiple devices assigned to it. 
However, multiple transmissions to an aggregator at a particular time slot are not allowed, hence 
the devices are scheduled based on TDMA. Once all devices complete their transmissions, the 
aggregators also incorporate their payloads, and then transmit the whole data to the BS. 

We assume perfect channel inversion power control, under which the received power at the 
BS from any device is unity. Then, the average total uplink power is given by Theorem Q] 


Theorem 1. The mean total uplink power of the devices in the Voronoi cell of the typical 
aggregator, i. e., the average of the total power consumption of the PAs, is given by 

A^y l 7 r \ m-v-2 


P( A.) = 


ttA u Pt 

77(7rA a ) 1+a / 2 


7 (| + l,A a 7rr c 2 ) + 


T]X 


max g—A a 7T r 2 


(3) 


where r c = (-Pr max / Pt) 1 ^ A defined as the critical distance. 


Proof: Proof is given in Appendix [A] where the final result can be obtained by incorporating 
the maximum power constraint into the mean additive characteristic associated with the typical 
cell of the access network model provided in f[29l Ch. 4.5]. ■ 


Corollary 1. If the maximum transmit power is unbounded, the mean total uplink power becomes 


lim P(\ a ) = 

P T max ->00 


kKPt r /a 1 
p(n\ a ) 1+a ' 2 V2 + 


Let N a be a random variable that denotes the number of devices within the Voronoi cell of a 
typical aggregator and its average can be obtained as 

. , V 

(4) 


E [N a ] — E^ a ^ lx„evb “ 27r ^« 


' r >0 


r dr = —, 

^ a 


where K 1 ,}^ is the expectation with respect to the Palm probability conditioned on 0 £ 4-7 
We denote the number of devices within the Voronoi cell of a typical aggregator and within 
distance d by N a (d), and its mean can be obtained as 


-A„7rr 2 


E [N a (d)] — E^, a ^ lx„GV 0 nB(o,d)) — / e 

where B(x, ||x||) is the closed ball centered at x and of radius |[x11. 


A„ 


r dr = — (1 — e 
o A a 


-\ a ird 2 


), 


(5) 


Lemma 1. The mean total received power is given by 

r* = £ (1 ■ exp( - ,rA “ r ' ) ) + 


a 


Pt rnax r (1 - -,vrA a r 2 c 


( 6 ) 






Proof: Proof is given in Appendix [B} ■ 

We now define the average energy terms as functions of the aggregator and device densities. 


Definition 1. Average total energy in a Voronoi cell of an aggregator. This is the average 
total energy consumption due to the transmissions within the coverage of the typical aggregator 
for a fixed payload per device. The average total energy in a typical cell is given by 

8v(X a ) — Ej{(X a ) + E t (\ a ). (7) 

Definition 2. Average total energy density. This is the average total energy consumption per 
unit area per unit time for a fixed payload per device, and can be found by scaling the average 
energy density per Voronoi cell by the number of Voronoi cells per area. The average number 
of Voronoi cells per unit area is given by E [N v \ = X/(X U /X a + 1) = A a . 

The dissipated energy density, i.e., the energy consumption per unit area, is defined as 

£{Xa) = X a 8v(Xa) = Xa (E R (X a ) + E T {X a )) . (8) 

Our objective is to find the optimal value of X a that minimizes the total energy consumption 
per unit area. As A a increases, the total number of aggregators and their total energy consumption 
to receive data will increase, and as A a decreases, the distance between the aggregators and the 
energy requirement of the typical transmitter will increase. We modify © as 

Er{ x a ) = E[At] (P LO + Prx + Po), Et{ Xa) = ft* [E [N a 2 ] P LO + E [N a ] P TX + P(A a )] , (9) 

where we recall f rx = f tx E [N a ] = At and E [N a 2 ] = yf + ff is the second moment of 

the number of devices per aggregatoU 

Any aggregator device aggregates data from multiple devices. Hence, the average number of 
devices per aggregator, i.e., E [N a ] = X u /X a = (1 — y)/y, is always greater than 1. Thus, the 
optimal fraction of aggregators that minimizes the overall energy density should satisfy 7 < 0.5. 

IV. Multi-Stage Energy Density Optimization 

We extend the model in Sect. m to the case where the aggregation process is repeated K > 1 
times before the data is eventually transmitted to the BS. Let E Rjk (X a (k)) and E Tk (X a (k)) be 


6 We derive E[N a 2 ] in Sect. m using an approximate model for the Voronoi cell areas detailed in (30l . 
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Fig. 1: A three-stage aggregation scheme with PPP(A = 1) device process. The aggregator processes (stage 1: □, stage 2: 
0, stage 3: *) are obtained by thinning the original PPP. and have densities A a (k) = 0.4* for stages k=l:3. Density for the 
initial transmitter process (stage 1: o) is A u (l) = A — X]fc=i ^a(k), and for the transmitter processes for later stages are 
A u (k) = X a (k — 1) for k > 2, respectively. Last stage aggregators correspond to the BSs. 

the average energies required for reception and transmission in the coverage of an aggregator 
and A a (A) be the aggregator density at stage k. The average total energy density at stage k is 

S k (X a (k)) = \ a (k)(E R , k (\ a (k)) + Er, k (\ a (k))). (10) 

The initial total density of the devices is given by A. The initial process with density A is 
independently thinned to obtain the set of aggregator processes with density A a (A) for stage k 
to form a disjoint set of aggregators for each stage. The density of aggregators at stage k is 
denoted by A a (k) = Ay fc , for k E {1,... , K — 1}. The total density of active devices that can 
be either transmitter and receiver at stage k is given by A (A) = A u (k) + A„(A). 

The transmitter device density of the first stage is found by subtracting the total density of 
aggregators from the initial density of the device process as A u (l) = A- Y^k=i A a (A). By the end 
of the first stage, the devices with a density of A„(l) will transmit their payload to the aggregator 
process with a density of A a (l). Next, at the second stage, the aggregators of the first stage form 
the new transmitting device process, i.e., A u (2) = A a (l), and these devices transmit to the devices 
forming the set of second stage aggregator process with density A a (2). The aggregation process 
can be extended to k > 2 stages in a similar manner by letting A u (k) = A a (A — 1) for k > 2. In 
Fig. |H we illustrate a three-stage model, where the aggregator process of stage k, i.e., is 

obtained by independently thinning T with probability 0.4 fc , i.e., 'If, (7c) has a density of 0.4 fc A. 

Generalizing ©, the average energy consumptions per unit area at stage k are given by 


En,k(X a (k)) — At(k)V cov (k — 1) (Plo + Pq + Prx ) (H) 


E Tyk (\ a {k)) = t tx (k)V cov (k - 1) 


E 1N„(A) 2 1 Pt^ + EIN.. 


where A t(k) = t lK (k)E [N n (A)], and V cov (k) is the joint SIR coverage probability up to and 
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including stage k at a given threshold. SIR coverage probability gives the fraction of successful 
transmissions with respect to an SIR threshold, and the actual average energy at stage k is 
determined by the fraction of successful transmissions up to stage k, i.e., the joint SIR coverage 
'Pcovik— 1). Hence, the current stage energy consumption scales with the fraction of the successful 
transmissions. For the devices in outage, no additional energy expenditure is incurred in the 
consecutive stages. In this paper, we interchangeably use A a (or A a (k)), A n (or A u (k)), N a (or 
N a (k)), and At(k) (or At) when the stage index is insignificant or clear from context. 


A. An Upper Bound on the Energy Density 


Let A a (K) = 


A a (l) A a (2) 


A ,0*0 


. Following the aggregation procedure, an upper 


bound for the average total energy consumption density over K stages is found by ignoring the 


SIR coverage probability as £u(A a (iT)) = Ylk=i^(K(k)) 


r> CO v(fc)=i,vfc 


that is equivalent to 


.k 


4.5 A u [kf 


Pi 


LO 


(13) 


£u(A„(A')) = + (A„(fc) + 3 5 

where Pc = Prx + Prx + Plo + P> We simplify (fl3l) by letting £ tx (l) = 1 as this scaling does 
not affect the result of the optimization formulation. 

The slot duration at stage k is At(k ) = t rx (k) and the duration that the transmit block is on 
is t tx (k). The relation between the average transmit times are as t tx (k + 1) = f tx (P)E [N a (/c)]. 
The transmit duration at slot k is proportional to the mean total number of bits to be transmitted 
as t tx (k) oc E nti 1 Na(i) > rii=i E [ N a(*)]- Note that N a(*)’s for i e {l,...,k - 1} are 
dependent random variables because of the correlation between the Voronoi cell areas of the 
aggregators at subsequent stages BTl . In fact, the inequality (a) follows from the observation 
that N a (i)’s are positively associated, i.e., when the values of one variable tend to increase as 
the values of the other variable increase as can be seen from Fig. [2] 

We denote the correlation between the number of devices within the Voronoi cells of the 
typical aggregators of the consecutive stages k and k + 1, i.e., the correlation between N a (/,:) 

and N a (/c + 1), by Pk,k+ i, which is expressed as 

E[N a (fc)N a (A; + 1)] - E[N a (fc)]E[N a (fc + 1)] 


Pk,k+1 — 


v/Var[N a (fc)]Var[N a (fc + 1)] 


(14) 


To the best of our knowledge, there is no analytical model in the literature for the correlation 
between the number of stages for the proposed data aggregation model. However, we can observe 
the trend of the correlation numerically. We illustrate how the correlation behaves for K = 4 
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Fig. 2: Numerical correlation over multiple stages. 


number of stages in Fig. 0 From Table [Til the average number of devices per aggregator is 
E[N a (l)] = (1 — Ylk=i ^ k )h f° r stage 1 and E[N a (fc)] = 7 -1 for stages k — 2,. .., K. As 7 
increases, E[N a (l)] decreases with a rate 1~ 2 [^ + Y^k =2 — l) 7 fc ] which is faster than E[N a (fc)] 

for k > 2 with a decrease rate of y -2 . Therefore, N a (l) dominates the trend of p lj2 and thus, we 
observe a different behavior for pi j2 compared to p i>i+ 1 for i > 1 , where the correlation is similar 
because the device and aggregator densities for higher stages are obtained with the same scaling 
parameter 7 . Correlation satisfies |p fc;fe+1 | < 0.4 for 7 e (0,0.3), which is sufficiently small 
given large number of stages. Thus, letting 7 e (0, 0.3) for analytical tractability, and assuming 
inter-stage independence among N a (z)’s, we can modify the expectation of the product as 

ftx(fc) CX TT fc 1E [Na(i)] = TT fc 1 /Aa(i). (15) 

-*- 2=1 -*-1= 1 


Remark 1. If there are K stages in total, at the last stage, all the aggregators of the previous 
stage (stage K — 1) with density X U (K) = A a (K — 1) are transmitting to the BSs with density 
Abs- Thus, at stage K, A a (K) = Abs- Therefore, the fraction of aggregators at the last stage is 

l{K) = A B s(A7 X_1 + A B s) _1 , (16) 

which is incorporated into the numerical analysis to find the minimizing aggregator fraction. 


B. Average Energy Density with an SIR Coverage Constraint 

This section mainly concentrates on the average energy consumption in the case of SIR outage. 
The average energy density of the proposed aggregator model is found by incorporating the SIR 
coverage probability of the typical transmitting device. If the received SIR at any stage k cannot 
exceed the threshold T, then the transmission is unsuccessful. Therefore, devices with a density 
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of P k {T)X u (k) will be successful at stage k, i.e., out of N a (k) transmitting devices served per 
aggregator, on average, only P k (T)E [N ;1 (/,:)] of them will successfully transmit their data, and 
the rest of the devices will not meet the minimum SIR requirement despite consuming energy. 
Therefore, the performance of the current stage depends on the previous stages. 

Incorporating the SIR coverage, the average total energy density in (fl3l) is modified as 



where V mv (k) denotes the joint SIR coverage probability for k stages and V (:ov (0) = 1, where 


we do not consider the effect of SIR coverage probability as all the devices of first stage transmit 
even though they might not be successful. In Sect. [Vj we consider sequential and parallel modes 


with the inter-stage independence assumption, i.e., the joint SIR coverage becomes 'P cov (k) = 


nt, Pi(T), where the dependence on T is inherent and omitted to keep the notation concise. 


Next, we formulate the energy density optimization problem assuming that SIR coverage 
probability of each device is unity, i.e., P cov (k ) = 1 for k 6 K}. Note that P cav (k) 

depends on the communication scheme, the network model parameters, such as device density, 
fading distribution, and the path loss, and is determined independently from the energy model. 
This section mainly concentrates on the energy model, while the SIR coverage and rate models - 
and hence, 'P CO v(^)’ s f° r various transmission schemes - will be discussed extensively in Sects. 
rvWIl In Sect. IVII1 we combine the energy results with the coverage characteristics for the 
successive and parallel modes, to evaluate the energy efficiencies of the proposed models. 

C. Energy Density Optimization Problem 

The following optimization problem minimizes the average total energy density over K stages: 


min £(A n (A')) 

A a (K) 


s.t. X u (k) = X a (k — 1), k e {2,... ,K} 


( 18 ) 


Ax(&) = t tx (k - 1)E [N a (fc - 1)], k e ( 2 ,..., K} 



where the aggregator density is X a (k) = Xy k for 1 < k < K, and X a (K) = A B s for stage I\. 


For the optimization formulation in (fl8T) . and for K > 2, we let yx = Yhk=i X k - The important 


design parameters of the total energy density are tabulated in Table [TT1 Using © and the design 
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parameters in Table [HI the mean total uplink power for different stages is given by 

’ (f + 1 , A'rvr?) + S^r«, k = 1 

P(\ a (k)) = < (f + 1, \~t k Trrl) + 2 < k < K - 1 

^ fr_i 

k = K 


77(7tA) < 

, r (f + 1, A BS 7 rr 7 ) + 


(19) 


, i?ABS 1 + “^ 2 ir “/ 2 


TPmax , — AbS 7T7* 2 

»?AbS 


Proposition 1. Combining 4771) with the constraints in 4771) /or wm't S7A coverage probability, 
we define c k ( 7 ) fo f/ze fota/ energy density at stage k E {1,..., A'}. Hence, c k ( 7 ) A given by 


Ck( 7 ) = < 


A [ (1 - 7tf) P C + 7 P(A 7 ) + [ (1 - 7*-) + if<t2s£]P L0 
A (1 - 7 x) fPc + 7 P(A 7 ‘) + [1 + || 7 - 1 ]Plo 


k = 1 


2 < k < K - 1 


( 20 ) 


Ml - 7*) [Pc + ^tP(Abs) + [1 + 


k = K 


Incorporating the constraints, and using 4771) . the objective function of 4771) is equivalent to 


K 


£{X a {K)) = Y J V co ,{k-l)c k { 1 ). 


( 21 ) 


k =1 


Similarly, the energy upper bound £\:(X a (Kj) can be minimized by evaluating 4771) at V cov (k) = 1 
\/k, and taking its derivative with respect to 7 . 


Proposition 2. c k ( 7 ) is an increasing function of k for 1 < k < K — 1, and Cr ( 7 ) is a decreasing 
function of total number of stages K. 

Proof: Note that y~ k increases with k, and P(y k X) is an increasing function of k for 
2 < k < K — 1. Using 4l0l) . it is easy to show that c k (y) < c fc+ i( 7 ) for 1 < k < K — 2 for fixed 
7 . Note also that -P(Abs) decreases, and A(Abs)/ 7^ -1 does not change with K, but 7^ _1 Plo 
decreases with K, hence it is trivial to show that cr( 7 ) is decreasing in K. ■ 

Proposition [7] implies that the energy density is higher for higher order stages. On the other 
hand, as the number of stages increases, the energy density at the last stage decreases. The 
number of stages cannot be increased arbitrarily. Next, to capture the energy consumption tradeoff 
between the stages, we investigate the effect of 7 on the total energy density. 


Remark 2. Using 4771) . we can easily observe that c k (y) is decreasing in 7 for 1 < k < K — 1. 
Note also that cr{ 7 ) is given by the following sum 


ck( 7 ) = A (1 - y K ) 


Pc + ^^Abs) + P L0 


+ A (1 — 7 x) 


4.5 A 


K -1 


-7 


LO- 


L3.5 A BS 

We observe that the first term decays with 7 as 1 — y K decreases with 7 , and P{\pf/y K 1 


is 
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k = 1 

2 < k < K - 1 

k = K 

A u (k) 

A(1 - 7 k) 

A7 *=- 1 

a 7 k - 1 

\a{k) 

A 7 

A7 fe 

Abs 

E[N a (fc)] 

(1 - iK)h 

7 -1 

A7 X-1 /Abs 

Lx(fc) 

1 

(1 

(1 — 1k)/i k ~ 1 


TABLE II: Design parameters. 

invariant to 7. The second term is not strictly a decreasing function of 7. On the other hand, it 
is strictly increasing for K > 2 and 7 E (0 ,a), where a E (0,0.5) and decreasing in K. As the 
second term is scaled by A/Abs> which is typically very high for the operating regime of M2M, 
it determines the trend of total energy. 

Proposition 3. Let y (K) * be the minimizer of the energy optimization problem in 4271 ) with the 
assumptior^ of V cm (k) = 1 \/k, and CK(y) is increasing in 7. Then, 7^)* [ s a decreasing 
function of the total number of stages K. 

Proof: See Appendix 0 ■ 

This section concentrates on the energy efficiency of M2M assuming perfect coverage. We 
now study the SIR coverage of the hierarchical M2M model for different transmission schemes. 

V. SIR Coverage Probability 

The data aggregation models in Sects. [inland [TV] do not incorporate the fact that the transmis¬ 
sions are not always successful. In this section, we generalize this to a coverage-based model, 
where the transmission is successful if the SIR of the device is above a threshold. We derive the 
probability of coverage in the uplink for the proposed data aggregation model in Sects. |]T[] and 
m We assume that the tagged aggregator and tagged devices experience Rayleigh fading, and 
we ignore shadowing. The transmit power at the typical node at distance r from its aggregator is 
P T (r) = min{P Tmax , P T r a }, and the received power is Pr(t) = gP T (r)r~ a , where g ~ exp(l). 

Orthogonal access is assumed in the uplink and at any given resource block, there is at most 
one device transmitting in each cell. Let be the point process denoting the location of devices 
transmitting on the same resource as the typical device. The uplink SIR of the typical device 

7 Note that it is trivial to extend this result for V cov {k) < 1 because 0 ^( 7 )’s are invariant to the coverage probability. 
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x <G 'IR located at distance ||x|| from its associated BS (aggregator) on a given resource block is 


SIR = 


Pr(t) 


g min { Pj 


\x\ 


■>Pt} 


E 9zl™{ P T ms . x ,PTR?}D- a ' 

ze^ u \{x} 


( 22 ) 


where R z and D z denote the distance between the transmitter aggregator pair and the distance 
between the interferer and the typical aggregator, respectively. The random variable g z ~ g is 
the small-scale iid fading parameter due to interferer 0 . 


Assumption 1. The actual distribution of R z is very hard to characterize due to the randomness 
both in the area of the Voronoi cell of the aggregator and in the number of the devices it serves. 
Therefore, we approximate it by the distance of a randomly chosen point in M 2 to its closest 
aggregator and hence it can be approximated by a Rayleigh distribution 021/ ; 

fn z { r z ) = ( r z / cr 2 )e -r ^ /2cr2 , r z > 0, a = A /l/(2vrA a ). (23) 

The uplink SIR coverage of the proposed system model is given by the following Lemma. 


Lemma 2. The uplink SIR coverage with truncated 

control and with minimum average path loss association 


jpower control: With truncated power 
the uplink SIR coverage is given by 


P(T) = pCUTPr) + C Ir (Tr°P^Jf R (r)dv, (24) 

J r c 

where p = 1 — exp ( — nX a r 2 ), R is Rayleigh distributed with parameter o = sjl) (2nX„), and 

C Ir {s) ~ exp ( -— 7 - f(l — e~ nXa P(l + TnX a r 2 )]P T C a {sP T ) 

V a — 2 V 

+ (1 - mA„-P Tm „E s , [Rl- a C a (sP T ^R -“)| R : , > r c ] )) (25) 

denotes the Laplace transform of the interference where C a (T ) — . /(l 1 — 4,2 — 4, —T 1 ), 
and 2 Pi is the Gauss-Hypergeometric function. 


Proof: See Appendix 10 ■ 

Corollary 2. The uplink SIR coverage with open loop power control (33): With open loop 
power control and with minimum average path loss association, the uplink SIR coverage is 

lim V(T) « exp ( - -^-C a {T) ). (26) 

p T max -ioo \ a — 2 J 


s In “minimum average path loss association”, a device associates to an aggregator with minimum path loss averaged over the 
small-scale fading, i.e., the aggregator has minimum product among all aggregators. 
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Proof: As P Tmax ^oo,r c = (Pt^JPt) V ° ->■ oo and p —y 1, which yields the final result. 


Corollary 3. The uplink SIR coverage is lower bounded by 


V lb (T) = P C LB (TP-) + / £F(Tr»P£Jf R (r) dr, 




LB / 


(27) 


where £/ B (s) is a lower bound for the Laplace transform of the interference and given as 




exp l — 


2 s 


(1 _ e -^(i + vrA a r 2 c ))P T C a {sP T ) 


a — 2 

p T max (vrA a )" /2 r(2 - 2/a, ttA a r 2 ) 

f-l f OO t -c/2 


(28) 


Proof: Noting that 6' (> (s) = // 


1 i r p ^ X 
l+s _ 1 t “ /2 Ufc ^ 


J/ 0 ^—rr^dt = 1 for a > 2, we 


obtain the following upper bound for the conditional expectation in step (g) of (1561) : 

(1 -p)Eh, [r?-“C 0 (sP T .„„rr)| ft > rj < (1 -p)E h . [Rl~ a \R, > rj , (29) 

where the RHS can be calculated as 

roo i 

(1-p)E Rz [Rl~ a | R z > r c ] = 1 2 v 1 - a ' 2 TT\ a e-* x ° v dv = ^ }1 _ a/2 r(2 - 2/a, vrA a r c 2 ). (30) 
From (l57l) . (l29l) and (l30l) . we can lower bound £/ r (s), which yields the final result. ■ 


A. Coverage Probability and Number of Stages 

The number of multi-hop stages K is mainly determined by the SIR coverage and the distance 
coverage , which we define as the probability that the distance between a device and its nearest 
aggregator is below a threshold. We provide bounds on K using these coverage concepts. 

Assumption 2. Interstage independence. The proposed hierarchical aggregation model intro¬ 
duces dependence among the stages of multi-hop communication since each subsequent stage is 
generated by the thinning of the previous stage. For analytical tractability, we assume that the 
multi-hop stages are independent of each o//zcv[j Hence, the transmission is successful if and 
only if all the individual stages are successful. The success probability over K stages is 

Pmy(K) = P(SIR 1 > T,..., SIR* > T) = Tf V k (T), 

where Vk(T) denotes the coverage probability at stage k. With full channel inversion and mini¬ 
mum average path loss association, the uplink SIR coverage is independent of the infrastructure 

? This assumption is required for the transmission modes described in detail in Sect. [YU and is validated in Sect. [yin 
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density as given in ( f26i) . For the case Pj — > oo, since ViJT) in ( l 26 l ) A also independent of 
the device density, and only depends on the threshold T and path loss exponent a, is identical 
for all stages, and denoted by V{T). 

Lemma 3. An upper bound on K ( Sequential mode). Given a minimum probability of coverage 
requirement V cov (K) > 1 — £ and an SIR threshold T, the number of stages is upper bounded by 


Kv = 


log (1/(1 -e)) 


and 


lim Ky = 


log (1/(1 -e)) fa- 2 


(31) 


log(ma x k V k (T)) ’ p Tmax ^°o TC Q (T) V 2 

Proof: The upper bound is obtained by combining (1241) with the condition V cov (K) > 1—e 
and using the relation V cov (K ) = \\ff = , Vif T) < max fc Vk{T) K . ■ 

In addition to the SIR outage, since the devices are randomly deployed, any device will be in 
outage when its nearest aggregator is outside its maximum transmission range. Thus, we also 
aim to investigate the minimum number of required stages given a distance outage constraint. 
A lower bound on the optimal number of multi-hop stages is given by the following Lemma. 


Lemma 4. A lower bound on K. The number of stages is lower bounded by 


AT = 


E[L(A 0 )]E 


r 1 


N a 


R u 


Pt 


(32) 


where L(X„) denotes the total length of the connections, N a is the number of devices in the 
Voronoi cell of a typical aggregator, Pt max is the maximum transmit power and T > ri, nin is the 
minimum detectable signal power at the receiver. 

Proof: The idea of the proof is similar to the proof of Theorem Q] (see Appendix |A]), where 
f{x) = 11x 11. The mean total length of connections, in the Voronoi cell of an aggregator equals 


E[A(A a )] = \ u / f(x)e 


—A 0 7r||a:|| 


dx = 2n\. 


r e 


dr = 


A,, 


i 3/2 ' 


(33) 


Jo 2A a 

Given a maximum transmitter power constraint, /V lllflx for each device, the maximum transmission 
range is given by (A Tmax /P Rm .J 1//Q . Dividing L( X a ) by N a and takingits expectation with respect 
to the distribution of N a , we obtain the mean length of connections and dividing this ratio by 
the maximum transmission range, we obtain the desired result. ■ 

The following Lemma provides a lower bound for K L that is based on the mean total length 
of the connections to the typical aggregator and the fraction of aggregators. 


"For tractability, we take expectation over a PPP first to find E[L(A a )], and then multiply it by E N)- assuming independence. 
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Lemma 5. K L is lower bounded by 

1 (1 ~ 7) f ^Rmin 

E [N a ] 2AV 2 7^/ 2+1 VP t _ 



< K l < K v . 


Proof: Using (1331) . we can lower bound K h as 


(34) 


Kl> 


f 1 

^ U 

fP R . x 1 /"! 

Jl min \ 

E[N a ] 

2\P 

KP T ) 

max 


where we use the convexity of 1/N a , i.e., E 
of K, we have the relation E[L(A a )] = - 


> _L 

— E[N £ 


r. Noting that X u and A Q 


_ A 7 x ~ 1 (1-7) _ (1-7) 


K since 7 < 0.5. Plugging this relation into 


2(\yKf/2 2A 1 / 2 7 if / 2 +l 

, we get the bound. 


, which is 


(35) 

are functions 
increasing in 


VI. Transmission Rate Models 

For an interference limited network, the rate of the typical device is given by Rate = 
3k log (1 + SIR), where W is the total bandwidth of the communication channel, and N a is the 
load at the typical aggregator. The average number of devices served by the typical aggregator 
is denoted by E[N a ] = jf. Rate coverage is defined as rate exceeding a given threshold, i.e., 

OO 

P(Rate > p) = (SIR > 2^ - l|N a = /) P Na (/), (36) 

1=0 

where PN a (0 is the probability mass function (PMF) of N a , and is given by the following Lemma. 


Lemma 6. The PMF of the number of devices served per aggregator of stage k, i.e., N ;1 (/;■), is 

3.5 3 - 5 (X u (k)/X a (k)Y r(3.5 + 0 


F N a(fc)(0 = 


^N a (fc)(°) 


n ( 3.5 + (x u (k)/x a (kW-^ r(3.5)r(z +1) ’ 

where X u {k) and X a {k) for k > 1 are given in Tabled 


(37) 


Proof: Normalized distribution function of Voronoi cell areas in 2D can be modeled by If30l 
as f(y) = r (3 5 ) y^ e ~^ V • Using the densities of the transmitters and the aggregators of stage k, 
the probability generating function (PGF) of the stage k devices in the random area y is |[34ll 


G^ (k) (z) = E[* n *«] = E[exp ((X u (k)/X a (k))y(z - 1))], (38) 

where conditioned on y, the PGF is of a Poisson random variable N a (k) with mean (A u {k) /A a (k))y. 

Combining (l38l) and the pdf f(y), we obtain the PGF of N a (/c) as 
'A u (k) 


Un 4 k) (z) = 


exp 


fX u (k) \ 3.5 3 ' 5 5 _ z 

2 iy = 


3.5 


r(3.5)' 


3.5 

(3.5 + (1 - z)X u {k)/X a (k)) 3A 


, (39) 


Then, the PMF of N a (fc) is recovered by taking derivatives of G. 























20 


The key assumption in our analysis is that there is one active device per resource block in 
each Voronoi cell. Using (l37l) and X u (k)/X a (k) = E[N a (/c)], the probability of not finding any 
device in the Voronoi cell of the typical aggregator at stage k is 


P NaW (0) =G Na(fc) (0) =3.5 3 ' 5 (3.5 + E[N a (fc)])- 3 - 5 . 


(40) 


The number of devices N a (/c) served by the typical aggregator is mainly determined by the 
fraction 7. When 7 is high, i.e., E[N a (/,:)] is low, the probability that there is no transmitting device 
within the Voronoi cell of the typical aggregator is not negligible. Let pth(k) be the probability 
that there is at least a device in the Voronoi cell of the typical aggregator in the k th stage. 
Therefore, the interference field of stage k is thinned by p t h(k), and the effective density of the 
interfering devices at stage k is pth{k)X u (k). Using (l40l) . pth(k) = 1 — 3.5 3 - 5 (3.5+ E[N a (/c)]) -3 ' 5 . 
From Table [Til we can see that pth(k) is the same for 1 < k < K, and different for k E {1, Kj. 

Note that in evaluating the average transmitter energy consumption in (fl2l) . we also require 
the second moment of the number of devices served per aggregator, which can be calculated 
adding the first and second derivatives of PGF of N a (/,:) in (l39l) evaluated at z = 1~ as follows 



(41) 


Conditioned on having at least one active device per resource block in each Voronoi cell, 


E[N a (fc)|N a (fc) > 0] =p th (fc)~ 1 E[N a (fc)] and E[N a 2 (fc)|N a (fc) > 0] =p th (A:)- 1 E[N a 2 (fc)]. 


The uplink SIR coverage for successive stages in (l26l) is independent of the device density and 
Pth(k). However, the rate coverage results for the parallel mode depend on the device density, 
and the interference field of stage k is thinned by p t h{k), as described in Sect. IVI-B1 
We consider two main transmission protocols, namely i) a successive transmission protocol 
where the stages are activated sequentially, i.e., a half-duplex sequential mode, and ii) a parallel 
transmission mode, which is either a full-duplex protocol where all stages are simultaneously 
active, or a half-duplex protocol with alternating active stages. We investigate their energy 
efficiencies and provide numerical comparisons for the rate-energy-delay tradeoffs in Sect. IVIII 

A. Rate Distribution for Successive Stages 

In this mode, each transmission cycle consists of the stages operating in succession. Stages 
may not be repeated before a cycle is completed. This mode may provide low rate, but it has 
low interference since multiple stages are not active simultaneously. Let K. — {1,..., K} denote 
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the set of stages and 1Z = {Ratei,..., Rate^} be the set of rates in terms of the total number of 
bits transmitted per unit time. Since the total number of bits that can be transmitted per K units 
of time is min fee £ Rate*;, the transmission rate in successive mode is Rs = K l min fee x: Rate*;. 

Remark 3. Dependence of hierarchical levels. The hierarchical levels are not independent 
from each other as detailed in Sect. |7V] and hence, it is not tractable to analyze the joint rate 
distribution for successive stages. Instead, we define the rate outage as in (1 42]) where transmission 
rates are assumed independent. Without tracking the path of the bits (payload) transmitted, we 
only consider if the hierarchical transmission process is successful. Transmission from a device 
is successful if its payload is delivered to the BS at the end of K stages. We give a comparison 
for the analyses with the independence assumption and the monte carlo simulations in Sect. IV7/I 

With the independence assumption in Remark (3} the rate coverage for successive stages is 

OO 

P(Rs > P) — H P(Rate fe > Kp) m n£ P fc (2^-l)P Na(k) (/). (42) 

kcK. kcK 1=0 

If we let P Tmax —>■ oo, the rate coverage is simplified to 

oo 2^4 i K j 

lim P(Rs > p) ~ nE«P- - i)K.o.)(0- 

Tmax kcK. 1=0 2 

B. Rate Distribution for Full-Duplex Parallel Stages 

In full-duplex parallel mode, transmissions are not interrupted during a transmission cycle 
unlike the successive transmission mode. All multi-hop stages operate in parallel, the 1 st stage 
devices only transmit, and the rest of the devices both transmit and aggregate simultaneously, 
during all stages of the multi-hop transmission. Therefore, this mode is a full-duplex model. 

Due to the simultaneous transmissions at all levels of the hierarchical model, the interference 
at each stage is due to i) the interferes of that stage, i.e., the intra-stage interference, and 
ii) the remaining transmitting devices of the other stages, i.e., the inter-stage interference. The 
hierarchical levels are determined in the same manner similar to the successive mode, and also 
dependent in this mode, and hence, the inter-stage interference is correlated. Although full- 
duplex parallel mode offers high transmission rate compared to successive mode, it has higher 
interference since all the stages are active. The intra-stage interference in the parallel mode can 
be obtained in the similar manner as in the successive mode. The following lemma provides the 
analytical expressions for the Laplace transforms of intra-stage and inter-stage interference. 
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Lemma 7. Given the active transmission stage k, the Laplace Transforms of the intra-stage 
interference and the inter-stage interference are given as follows: 

(a) The Laplace transform of the intra-stage interference at stage k is 

2s 


C Ik (s) -exp - 


a — 2 


(1 _ e -Af (fc)r ?(1 + nX f(k)r 2 c ))P T C a (sP T ) 


+ (l-p k ) 7 r xf(k)p Tm JZ R 


R 2 ~ a C n 


sP T 


R 


Zk 


R^k ^c 


, ( 43 ) 


where p k = 1 - exp(-7rA a (fc)r;?) and A f(k) = p th (k)X a (k). 

(b) The Laplace transform of the total inter-stage interference from stages {l \ I £ /C. / f k) is 

^ , 2 sPn 




: W « n “P ( - (! - + ttA f(I)r?))(B a (sP T ) + f-jC a (sP T )) 

l£k c 


(l- Pl )nXf(l)E R 


K ( B a 


sPr 


R a 

1L Z t 


2sP T 


-C n 


sPr 


R a 

1L Zl 


Rzi r c 


, ( 44 ) 


(a - 2)i?“ 

where B a (s) = 2 i 7 i (l- '^,1 + j-, — 4J is obtained using the Gauss-Hypergeometric function. 
Using the Laplace transforms, the uplink SIR coverage V k {T) can be found using 


Proof: See Appendix [E| 


Corollary 4. The Laplace transform of the total inter-stage interference for Pj 


—> oo is 


lim £i k c(s) « exp 

p T max ->-oo 


B a (sPj 


2 sPn 


a 


■C a (sP 7 


(K- 1) 


( 45 ) 


Lemma 8. The uplink SIR coverage probability for the full-duplex parallel mode is given by 

— / /*oo 

P(SIRp > r) = n fCh(TPT )C,„ATP7) + / C h (Tr°PpJC, tc (Tr°PpJf Rk (r)dr),(46) 

Ur ir J r c . 


keic 


where f Rk (r ) = ( r/a 2 k )e r2/2a2 for r > 0 and a k = 1 /(2 ttX a (k)). 


Proof: The proof is similar to that of Theorem I in lf33l . However, instead of one serv¬ 
ing stage, the serving stages change sequentially. Result follows from the evaluation of (1431) 
using (1241) . Since the total inter-stage interference in ([59b is independent from the intra-stage 
interference, its Laplace transform is incorporated as a multiplicative term. ■ 

Using ([26b and (1451) . the uplink SIR coverage for / : V ni;ix —>■ oo is given by 

P(SIR P > T) « J] exp ( - — ZTjCa{T)j exp ( - B a (T) - 

k£lC a a 


( 47 ) 
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Lemma 9. The rate coverage probability for the full-duplex parallel transmission mode is 

OO 

P(Rp > p) = JJ F(Rate fe > p) — ns P(SIR, > 2# - l|N a (fc) = /)P Na( k)(0- (48) 

k£lC keic 1=0 

Proof: Proof follows from the combination of (l46l) with the definition of rate. ■ 

The rate coverage expression for the full-duplex parallel mode is validated in Sect. IVIII 

Remark 4. Although the full-duplex parallel transmission strategy is probably not feasible for 
M2M communication, it is helpful to have a comparison of the rate-energy tradeoffs of both 
models, and provided for completeness. The models described above, i.e., the sequential mode 
which is half-duplex by design, and the full-duplex parallel mode, are the two principle design 
schemes that mainly differ in terms of their total rate coverages and energy consumptions. 

Next, we introduce the half-duplex parallel transmission, which is a feasible scheme for M2M. 


C. Rate Distribution for Half-Duplex Parallel Stages 

The full-duplex parallel mode can be transformed into a half-duplex communication scheme. 
In this mode, at a particular time slot, only the even or odd stages are active. Analysis of this 
model is quite similar to the parallel mode analysis, using only the active stages when calculating 
the inter-stage interference. The SIR coverage of the half-duplex mode can be characterized in a 
fashion similar to full-duplex mode SIR coverage in (l46l) . and is given by the following lemma. 


Lemma 10. The uplink SIR coverage probability for the half-duplex parallel mode is given by 


PC 

P(SIRp>T)« n {p^ h (TP T 1 )C,pTP T ) + 

Us- 1r~ J Vc. 


^(Tr a P^)C hc (Tr-P^Jf Ri (r) dr .(49) 




where calculations of X'f(k) and the distributions of {R Zk } in (| ~43§ and 091 ) are done over the 
set of active stages and /C Q dd denotes the set of active stages, i.e., the odd stages. 

The uplink SIR coverage for Pr max —* oo is given by 

P(SIR P > T) J] eX p(--—C a (T))exp(-B a (T)--—^C a (T)y . (50) 


For the half-duplex mode, the rate coverage expression in (l48l) changes in accordance with 
(l49l) . which can be found by following the steps in Lemma [9l Since only half of the stages are 
active simultaneously, the overall rate is half the rate of the active stages. Thus, to achieve a 
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Fig. 3: Number of hops versus the total SIR outage proba¬ 
bility constraint, e for variable T. 


Fig. 4: Total energy density, with optimal fraction of aggre¬ 
gators, 7 0 pt, marked for Plo = 5mW (28]. 


rate threshold of p, the rate threshold for the active stages should be set to 2 p. Note also that 
p t h(/c)’s are modified in accordance with the updated values of E[N a (/c)]. 


D. Expected Communication Delay 


Average communication delay depends on the number of hops in the transmission and the 
communication rate. Using the bounds (OTT) and ([32b on the total number of hops K in Sect. 
IV-A1 and denoting the delay of the typical device due to the transmission of a payload of M bits 
at rate R over K stages by V(R,K), it satisfies V(R,K) = t(R,K ) — t(R, 1), where t(R, 1) 
is the direct transmission duration as a function of the transmission rate R in the unit of bits/s 
and the payload M in the unit of bits, and it is given as t(R, 1) = MR _1 , and t(R, K ) is the 
transmission duration over K hops. We calculate the average transmission delay conditioned on 
the devices that satisfy the minimum rate threshold p over K hops, which is obtained as 


E [V{R, K)\R> p] = E [t(R, K )| R > p\ — ME 


r 1 
R 


R > p 


(51) 


where E [±| R > p] = f Q 1/p 1 - P (R > ±) 


-i/p 


1 — V{2wt — 1) dfP Na (/), which 


& - E/o 

i=i 

can be calculated using the rate distribution given in ([36b . 

The expected delay depends on the transmission protocol. As discussed in Sect. [VT1 we 
mainly focus on two different transmission protocols, i.e., successive and full-duplex parallel 
transmission modes as described in detail in Sects. IVI-AI and IVI-Bl respectively. 

Successive Transmission. Using the distribution of the sequential mode transmission rate Rs 
given in (l42l) . the expected communication duration for K stages can be obtained as 


E[t(i2 s , A')|i2 s > p\ « MK 


"!/P r 


nE”„ p 42&-i)PN. W (() 


dt. 


keic 


(52) 
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Fig. 5: Total energy density, with optimal fraction of aggre¬ 
gators, 7o P t, marked for Plo = Prx /4 (28l l. 



Fig. 6: Rate coverage distribution with maximum power 
constraint Pr max /PT = [1, 5,10, 20], 


Parallel Transmission. The distribution of the full-duplex parallel mode transmission rate Rp 
is given in (1481) . Therefore, the expected communication duration is obtained as 


E \t(Rp, K ) I R P > p] 


A/T TP 


-1/p 


oo 


keJC 1=0 

where P(SIRfc > 2 wt — 1) can be calculated using (l46l) . 

The expected delay for the sequential (or full-duplex parallel) mode can be found by combining 
(l52l) (or (1531) ) with the delay expression in (I5TT) . 


VII. Numerical Results 

For the performance evaluation of the aggregator-based M2M communication model in Sects. 
IfflHTVl and the rate coverage models in Sect. [VH we incorporate the power-dissipation factors of 
IEEE 802.11b (WiFi) technology ll28il . The simulation parameters are shown in Table Hill 

Optimal number of hops. We illustrate the trend between the number of required stages 
versus the outage probability constraint for different thresholds T = {0.01, 0.1,1} in Fig. 0 For 
low outage probabilities, i.e., small £ values, Kp should be also low as the average transmission 
range is long. As £ increases, the maximum transmission range decreases and Kp increases. The 
variation of the lower bound on the number of hops, Kp, for varying £ and Kp is also shown. 

Optimal fraction of receivers decay with the number of stages. We evaluate the performance 
of the energy model in Sects. [111] and [TV] for K e {1,..., 10}. In Figs. BE we observe the 
variation of the average total energy density with respect to 7 for different Pp 0 , where E D (solid 
line) corresponds to the energy density of direct transmission to the BS. The optimal values of 
7 that minimize the total energy density, i.e., 7 opt (marked), are decreasing with K. 
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The simulation setup for the verification of analytical rate models developed in Sect. [VI] 
is as follows. Device locations are distributed as PPP over a square region of size 5 x 5 sq. 
km. Note that the density of aggregators at stage k is Ay fc , i.e., the number of aggregators 
decay geometrically. Therefore, for high K values we need a region with much larger area 
for the validation of the model, but scaling the region increases the computational complexity 
exponentially. Therefore, we restrict ourselves to K — 1 : 3, which is also consistent with the 
plot characteristics on the number of hops illustrated in Fig. [3J and investigate the performance 
of the proposed model. Furthermore, we approximate (l42l) by truncating N a over the range 
/ = 0,1 ,...,L, where L is set to 20. This is a reasonable range as the fraction of aggregators 
is chosen to be 7 = 0.1, i.e., there are 9 devices per aggregator on average. 

Rate coverage for half-duplex and full-duplex modes. The rate coverage for a single- 
stage model for different maximum power levels is shown in Fig. [6] As Pt„ 1!ix / Pt increases, 
coverage is improved for thresholds less than 10 4 bps. For the rest of the simulations, we assume 
/V max -A 00 for tractability. The rate coverage for the sequential and the full-duplex modes for 
different number of hops are illustrated in Figs. [7] and [8} The analytical results do not exactly 
match the simulations, and the difference is due to the interstage independence assumption in 
Sect. IV-Al and the independent power control assumption in lf32l . which is the main limitation of 
this work. Comparing Figs. [7] and [8} we observe that the full-duplex mode does not offer higher 
rate coverage compared to sequential mode, which is due to inter-stage interference as detailed 
in Sect. IVI-BI Thus, in terms of rate coverage, sequential mode is preferable over full-duplex 
mode. From (l50l) and (l47l) . the half-duplex parallel mode has higher coverage than full-duplex 
mode for the same K. Due to limited space, we do not provide an illustration for that scheme. 

Rate-energy tradeoffs. We compare the average total energy density for the sequential and 
full-duplex parallel schemes in Fig. [9J The rate coverage decays as the hop number increases, 
which reduces the chance of successful transmissions, and the overall energy consumption. To 
make a relevant comparison based on the energy consumptions of sequential and parallel modes, 


Parameter 

Value/Range 

Parameter 

Value/Range 

Total bandwidth 

W = 10 b Hz 

Power consumption of RX (TX) 

P RX = 200 (Ptx = 100) mW 

Path loss exponent 

a~ 4 

Network size 

2 R x 2 R sq. km, R = 2.5km 

Device (BS) density 

A = ICE (Abs = 1) per sq. km 

Payload 

M = 100 bits 


TABLE III: Simulation parameters. 
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Fig. 7: Rate coverage distributions in sequential mode for Fig. 8: Rate coverage distributions in full-duplex parallel 

K = {1, 2,3} stages. mode for K = {1, 2, 3,4} stages. 

we consider multiple transmission cycles equal to the number of stages, i.e., K = 2. On the plot 
legend, S P and £5 stand for the total energy densities calculated based on the actual rate coverage 
probabilities, and Q P and Qs stand for the rate outage probabilities, i.e., complements of the rate 
coverage probabilities, of the full-duplex parallel and sequential transmission modes, respectively. 
In full-duplex mode, all devices are always active and transmitting, but the communication rate 
is low due to higher interference, hence, its energy consumption is less than the successive 
mode. We also investigate the average total energy density for the half-duplex parallel scheme 
with K = 2 in Fig. OH In the figure legend, HD stands for half-duplex. This scheme has higher 
coverage probability as only half of the stages are operating simultaneously, and higher transmit 
distance as the aggregator to device fraction is 7 2 <7 7, which yields higher energy consumption. 

Different transmission modes have revealed the tradeoffs between the coverage and the energy 
requirements. Full-duplex mode has low energy density, but low rate coverage and high delay. 
Sequential mode also has low energy density, high rate coverage and low delay. On the other hand, 
half-duplex parallel mode has higher energy density (x 2 ) and higher rate coverage compared to 
full-duplex mode. Unlike the full-duplex mode, which is not convenient for M2M communication 
despite being energy efficient, the half-duplex parallel mode is favorable with higher energy 
consumption. However, its energy consumption can be reduced by readjusting the aggregator 
fractions for alternating stages. Since M2M is delay tolerant, sequential mode is also feasible 
and a low-cost technique. Considering the operating regime for M2M devices and the simplicity 
of their design, sequential mode is preferable as it has low energy density and high coverage. 
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Fig. 9: A rate outage-total energy density tradeoff for 
sequential and parallel modes, K = 2. 


Fig. 10: A rate outage-total energy density tradeoff com¬ 
parison of half-duplex parallel model for K = 2. 


VIII. Conclusions 


We study a general multi-hop-based uplink communication scheme for M2M communication, 
and develop a new data aggregation model for M2M devices, using tools from stochastic 
geometry. To the best of our knowledge, this is the first work on power-limited communication 
providing a unified energy consumption model with coverage in cellular networks. Our results 
show that the uplink coverage characteristics dominate the trend of the energy consumption for 
the proposed transmission modes. Considering the operating regime of M2M devices, sequential 
and half-duplex parallel modes are more feasible compared to full-duplex mode. 

Interesting extensions would include the minimization of the energy expenditure through joint 
optimization of the optimal number of multi-hop stages and fraction of aggregators. Better energy 
models can be developed to incorporate the different states of the transceiver, i.e., on, idle, sleep 
and off states, which will provide a more accurate energy model for the proposed communication 
scheme. Strategies for synchronization of transmissions is also important to prevent multi-hop 
delays and save receiver energy consumption. Multi-slope path loss models ll35l would more 
accurately model the effect of interference and noise for networks with increasing device density. 


Appendix 


A. Proof of Theorem [Tj 

Let = E Jx n , = En^W , and Vq be the Voronoi cell of the typical aggregator 
X 0 e 'ha located at the origin. Notice that x e V 0 if and only if 'V a (B°(x, ||x||)) = 0, where 
B°(x, H#||) is the open ball centered at x and of radius ||x||. Hence, 
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which is the expectation with respect to the conditional (Palm) probability conditioned on 0 E 'ha. 
As 'P n is PPP due to thinning, from Slivnyak’s theorem ||29ll , the RHS can be rewritten as 


/ /(x)P° o [tf a (B°(x, ||x||)) = 0 ] dx = / /(x)P [tf a (B°(x, ||x||)) = 0 ] dx. 

Jr 2 Jr 2 

Using the Poisson law, we have E^ o £) f(X n )lx n ev 0 = / R 2 / 0 r)e _Aa7r|N|2 dx. We let f(x) = 

n 

inin{Pr max , Pt ||x|| a } be the transmit power with a maximum power constraint /V max . Then, we 
have the following final result for the mean total uplink power of the devices in the Voronoi cell 
of the typical aggregator, i.e., the total power consumption of the PAs: 


P{ K) = 


27tA u Pn 


r a e -\ a rrr 2 r dr + 


V 

where r c = (f :> r,„ HX / Pi) ' A \ and the final result follows from employing a change of variables 
u = X a nr 2 , and y(s,x) = J 0 l f s_ 1 e _t df is the lower incomplete gamma function. 


2 n\ u P Tm 
V 




r dr, 


B. Proof of Lemma [7] 

The average received power from a device at distance x from the typical aggregator is P R {x ) = 
min{Pr max ||x|| _a , Pt}- The average number of devices within distance r c = (Pr intix /Pr) ' n of 
the typical aggregator is E [N a (r c )] = ^ (1 — exp (—7rA a r 2 )). Using the Poisson law, the mean 
total received power is given by 

P POO 

Pr = E°r P R (X n ) ljf neVo = A u / P R (x)e~ x ^ x II 2 dx = 27rA u / P R (r)e~ x ^ r2 r dr, 

„ Jr 2 Jo 

where the final result follows from incorporating P R {x) to split the integral into two parts, 
employing a change of variables A a 7rr 2 = u and the definition of incomplete Gamma function. 


C. Proof of Proposition 0 

Note that £(X a (K)) = Ylk=i c k(l)’ where c k { 7 ) is given in Proposition Q] Note also that 
min{£(A a (A'))} + uiin{c R - + i( 7 )} < mm{£(X a (K + 1))}. Adopting 7 ^ to represent the ratio 
for K stages, and letting 7 ^* be the optimal value for K stages, optimal solutions of £(A„(A")) 
and £(X a {K + 1 )) are 7 ^)* — arg min£(A a (A”)) and y( A+1) * = arg min £(X a (K + 1)), yielding 

k=1 c kin {K) *) + min{c^ + i(7)} < c k{l (K+1) *)- (54) 


It is also clear that 




( 55 ) 
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Using (1541) and (l55l) . we have miii{c ^ +1 (7)} < c K+ i( 7 ^ x+1 ^*). Assume that -y( A+1 )* > yW*. 
Then, from Remark [2] cy(y 7A7 *) > c^(^( A+1 )*) for k < K — 1, and c R ( 7 ^*) < c^y^ 447 *). 
Using (|55T) . and from 7 ( A '+ 1 )* > y( A )*, we can infer that CK + i(y 7A+1 )*) is increasing in 7 ( A '+ 1 )*, 
and reduce ck + i( 7 7a+17 *), i.e., the energy density at stage K+ 1, by decreasing y 7A+l7 * to y 7A7 *, 
which implies y( A+1 )* ^ argmin£(A 0 (A" + 1)), giving a contradiction. Thus, y 7A '+ 1 )* < yW*. 


Z). Proof of Lemma [2] 

The Laplace transform of the interference can be written as 


£,„(s) ( = E 
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(1 _ e -^(l + vrA y c ))P T C a {sP T ) 


+ (l-p)irA a F Tn ,„E R , [fl?-“C a ( S P Tm ««:“)|fl,>i-J jj, (56) 

where (a) follows from the iid nature of {g z } and the independence of { R z } (see Assumption [U). 
The process T,, is not a PPP but a Poisson-Voronoi perturbed lattice and hence the functional form 
of the interference (or the Laplace functional of \L U ) is not tractable [|33l . Authors in |[33l propose 
an approximation to characterize the corresponding process as an inhomogeneous PPP with inten¬ 
sity measure function A u (dy) = 2ii\ a y(dy). Hence, ( b ) follows from the definition of probability 
generating functional (PGFL) of the PPP lf27l . the independent path loss between the device and 
its serving aggregator ll33ll . i.e., R z ’s are independent, and the assumption that interferer’s path 
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loss is bounded as R z < y a ll33Tl . (c) follows from the maximum power constraint, where 
p — 1 — exp ( — irX a rf) denotes the probability that R z < r c and can be obtained using © and 
©, ( d) follows from change of variables t = ( x/R z ) 2 , (e) follows using the definition of Gauss- 
Hypergeometric function, yielding /” 1+J ,- lt a/a dt = ^ 2^1 (l, 1 — ^,2 — f, -s) = ^C^s), 
(/) follows from Assumption |T| for Hence, R 2 is exponentially distributed with rate parameter 
l/(2cr 2 ) = 7rA a that yields 


pE Rz [R 2 z \R z <r c ] = / vnX a e nXaV dv = 


1 — e 7rA “ rc (1 + 7 rA a r 2 ) 


JO 

Conditioned on the distance between the device and its associated aggregator, 


(57) 


P(SIR > T\r < r c ) = E/ r P (g > TP t I r \r < r, 


r—1 


ZiriTPr), 


P(SIR > T\r > r c ) = E /r [P (g > Tr a P~'j r \r > r c )] « C lr (Tr a P^J. 

Hence, the uplink SIR coverage is obtained as V(T) ~ pC Ir (TP T 1 )+ J' x C Jr ( Tr°P-} t l ]y ) fu ( r ) dr, 
where R is Rayleigh distributed with parameter cr = \Jl/ (27rA„) from Assumption Q] 


E. Proof of Lemma [7| 

Part (a). The intra-stage interference can be found using the density of active receiving devices 
of stage k, which is given by A® ff (/c) = Pth(k)X a (k). The Laplace transform of I k is 
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exp 


exp 
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2s 


V g„ min { P T ,P T R a \D 
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(1 _ e -^Af {k)rl^ + vrA f{k)r 2 c ))P T C a (sP T ) 


(58) 


+ (1 -Pk)*\f(k)PT,A, k [Rl; a C a ( S P Tim R ;°)I R Zk > r„] )). 

This result is similar to the Laplace transform of the interference in (l25l) . where the distribution 
of R Zk is Rayleigh, but with parameter o k = \/1 /(2 ttX a (k)) and p k = 1 — exp(— nX a (k)r 2 ) is 
the probability that R z k < r c that decreases with increasing k. 

Part (b). Let I k c be the total inter-stage interference at stage k, and \I/ U)fc c = be the 

point process denoting the location of inter-stage devices transmitting on the same resource as the 
typical device. Note that the transmitter processes for all stages are determined by independent 
thinning of the initial device process T modeled as PPP Hence, x \i u k c is the superposition of the 
inter-stage transmitter processes, and is a PPP with density A® ff (/P) = ]C,yfcPth(j)A n (j), and 
is also independent of since the random variables { R Zk } are assumed independent 
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Then, the Laplace transform of I k c, i.e., £ IkC (s), is given by 
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where R Zl is Rayleigh with parameter 07 = \/l/(27rA a (/)), pi — 1 — exp(—7rA a (/)r 2 ) is the 
probability that B z j < r c , and (a) follows from the independence of { R Z[ } over stages {/}. 
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